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^^ ' Abstract. We estimate density and regression functions for weak de- 

Cn J pendant datas. Using an exponential inequality obtained in |DePI and in 

$-H ■ |Mau2) , we control the deviation between the estimator and the function 

ft' itself. These results are applied to a large class of dynamical systems 

■^C , and lead to estimations of invariant densities and on the mapping itself. 

o 
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Dynamical systems are widely used by scientists to modalize complex 
systems ( jABSTj ). Therefore, estimating functions related to dynamical 
Q | systems is crucial. Of particular interest are : the invariant density, the 

mapping itself, the pressure function. We shall see that many dynamical 
systems have the same behavior as weak dependant processes (as defined in 
jDoLoj ). We obtain results of deviation for regression functions and den- 
sities for weak dependant processes and apply these results to dynamical 
systems. 

In |Mau2j we gave an estimation (with control of the deviation) of the pres- 
sure function for some expanding maps of the interval. Results on the esti- 
mation of the invariant density for the same kind of maps where obtained 
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£NJ ■ in jF] and stated in [Mac . In this later article, results on the estimation 

of the mapping were also stated. These last two papers dealt with conver- 
gence in quadratic mean. Our goal here is, on one hand, to consider more 

^^ ■ general dynamical systems : non uniformly hyperbolic maps on the interval, 



Ctf 



dynamics in higher dimension ... On the second hand, we obtain bounds on 
the deviation between the estimator and the regression function as well as 
almost sure convergence. Related results on the estimation of the regression 
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function may also be found in |FVj and |Masj where strongly mixing pro- 
cesses are considered and almost everywhere convergence and asymptotic 
K> , normality are proved. Our aim is to provide an estimation of the deviation 

between the estimator and the regression function for a larger class of mix- 
ing processes and for regression functions that may have singularities. 
Before giving the precise definitions and results, let us state our main results 
informally. We consider a weak dependant stationary process Xq, ..., Xi, ... 
taking values in E C I . Our condition of weak dependence is with respect 
to a Banach space C of bounded functions on £ (see Definition ^ below). Let 
(Yj)jgN be a stationary process taking values in K and satisfying a condition 
of weak dependence according with the process Xq, ..., Xi, ... Consider the 
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regression function r(x) = E(l$ | Xi = x), we shall assume some regularity 
on r (see Assumption |2] below). We shall also consider that the process 
(Xi)i £ f$ has a density / with respect to the Lebesgue measure m. f is ass- 
umed to have some regularity properties that allow localised singularities. 
Consider a non negative kernel K € C satisfying Assumption ^ and the esti- 
mators (introduced in [E] and used for example in [F] , |Maej , |FVj , |Mas| ) : 

i=0 v y i=0 v 

(0.1) r n {x) = j—. 

fn{X) 

Remark that f n ( x ) — implies g n { x ) = 0, in that case, we define r n (x) = 0. 

Theorem. Assume some weak dependence condition on Xi and Yi (see Def- 
initional, assume that inf/ > 0. There exists M > 0, L > 0, R > 0, 
< (3 < 1, D > 0, 7' > swc/i t/iai, outside a set of measure less than 
DK 1 ' , for all n £ N*, for all t € R + , for all u > Cth, 

F(\f n (x) - f(x)\ >t-u a )< 2ee expf-^M/t^n] if f is a regular 
if r is bounded and a-regular, 

P(|f n (x) -r(s)| > t-u a ) < e^ ^expf-^Ln/i^ 2 ] +exp[-L / n/ 1 /3+2 ]) , 

ifYi£ L°° and r is a-regular, 

P(|f„(x) - r(x)\ >t-u a ) < 2e L ? expf-^Ln/i^ 2 ]. 

As a consequence, provided h = h n goes to zero and nhn = 0(n e ), e > 0, 
we obtain the following convergences provided r (and f) are a-regular : 

• form almost all x £ X, f n (x) converges to f(x) andr n (x) converges 
to r(x) almost surely and in L p for any 1 < p, 

• El/ \f n {%) — f(x)\dx ) and E I / \r n (x) — r(x)\ J go to zero. 

• for almost all i£E, for a < \, \g n (x) — g(x)\ = Op(n~ a ) where g n 
is either f n or r n and g is either f or r. 

Using a "time inverted process" (as in |DeP| IMau2j ). we deduce an es- 
timation of the invariant density and of the mapping itself for dynamical 
systems. Consider a discrete dynamical system from a "good class" (see 
Definition 11.41 for precise settings), denote by T the mapping and / the in- 
variant density of interest. Pick Xq at random with stationary law /i = fm 
and let Xi = T 1 (Xq). Consider a kernel K and f n as above, let r 3 n be the 
estimator for Y? = X° i+l + e], j = 1, . . . , d, Xj is the jth coordinate of Xi 
and (e^)jgH are independent stationary process with mean, independent 
of (Xi)i^{. Let T n be an estimator of the map T which has coordinates rh. 
Let || \\d be the sup norm in M d , we have the following results. 

Corollary. Assume that the dynamical is in the good class. There exists 
M > 0, L > 0, R > 0, < j3 < 1, 7' > such that outside a set of measure 
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less than RK 1 , for all t € M + , for all u > Cth, 

(0.2) is f is regular P(|/„0) - f(x)\ > t - u a ) < 2e^ exp[-t 2 Mh^ +2 n] 

(0.3) F(\\f n (x)-T(x)\\ d >t-u a ) < 2e^exp[-t 2 Lh l3+2 n\. 

As a consequence, provided h = h n goes to zero and nhn = 0(n £ ), e > 0, 
we obtain the following convergences : 

• for m almost all x € X, T n (x) converges to T(x) almost surely and 
in L p for any 1 < p, the same holds for f n (x) provided that f is 
a -regular. 

• El/ ||T n (:r) — T(x)||d I go to zero, the same holds for f n (x) pro- 
vided that f is a-regular. 

• for almost all i£E, for a < \, \g n (x) — g(x)\ = Op(n~ a ) where g n 
is either f n or T n and g is either f (provided it is a-regular) or T. 

In a first section, we state our hypothesis on the process and the dynamical 
systems. We also give the precise results. 
The second section is devoted to the proofs. 

In the last section we provide examples of dynamical systems satisfying our 
hypothesis. We also provide some simulations. 

Contents 
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1. Hypothesis and statement of the results 

1.1. Weak dependence. As mentioned quickly in the introduction, we 
shall consider a class of weak mixing process with respect to a Banach space 
of bounded functions C. This functional definition of dependence is more 
general than strong mixing used in |Mas| and |FVj . We shall see that it 
encounters a large class of dynamical systems (see also |DePj for other ex- 
amples than dynamical systems). This kind of functional definition has 
been introduced in |DoLoj and |DePj for Lipschitz or BV functions (see also 
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[Mau2| ). 

For simplicity, let £ C M rf and (JQ)j £ n a process taking values in S. Our 
proofs could probably be extended to sets £ included in more general Banach 
spaces. In the following, || ||^ denotes the sup norm on W 1 : ||(xi, . . . , Xd)\\d = 
max(|xj|, i = 1, . . . ,d). 

Definition 1. Let C be a Banach space of bounded functions on E. We 
consider a norm on C of the form : 

llffllc = llfflU +C(g) 

where C(-) is a semi- norm on C and || • ||oo is the sup. norm on C. Let C\ 
be the "semi-ball" of functions g G C such that C(g) < 1. Let Mi be the 
er-algebra generated by Xq, . . . ,X{. The C- mixing coefficients are : 

$ c (n) = sup{\E(Zg(X i+n ))-E(Z)E(g(X l+n ))\ i G N, Z is 

(*) .A/fj — measurable and \\Z\\i < 1, g G Ci}. 

The process (Xj)j g N is C-weakly dependant if (<3?c(n)) ng N ^ s summable. 
Let 7j be a I 1 stationary process, taking values in M. Let A4i be the 
er-algebra generated by Xq,Yq, . . . ,Xi,Yi. We shall say that pQ)j e pj is C- 
weakly dependant with respect to (Yi)jgN if the coefficients : 

*C,(n) i6N (n) =sup{|E(Zy i+n g(X i+n )) - E(Z)E(y i+n g(X i+n ))| »6N, Z is 

(**) A^j - measurable and ||Z||i < 1, g G Ci}, 

are summable. 

Remark. Examples of spaces C that we shall consider are : 

• the space of function with bounded variations, in that case, C(g) is 
the total variation of g ; 

• the space of Lipschitz (resp. Holder) functions, in that case, C(g) is 
the Lipschitz (resp. Holder) constant ; 

• the space of C 1 function could also be considered, in that case, C(g) 
is the sup norm of g' . 

1.2. Regularity conditions. We now state the conditions on the kernel 
K, the regression function r and the density function /. 

Assumption 1. The kernel K is a nonnegative function in C (so it is 
bounded), with compact support D and with integral 1 : 



L 



Kdm = 1. 

E 



For h > and x G S, let Kh >x (t) = K (\-^)- We assume that there exists 
< < 1 such that C(K h , x ) < ^p-. 

Assumption 2. Consider a function g on X and < a < 1. Let B g (u,h) 
be the set of points x such that 

sup \g(x) - g(y)\ > u a . 

d(x,y)<h 

If g is a map from T, to E, B g (u, h) is the set of points x such that 

sup \\g{x) -g{y)\\d >u a . 

d(x,y)<h 
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For any decreasing to zero sequences (u n ) nS N and (/i n )ngN> let A n = B g (u n , h n 
and B N = p| \J A p . 

n>N p>n 

We shall say that g is a-regular if for any sequence (u n ) nG N decreasing 
to 0, for any sequence (/i n )neN with h n = o(u n ), there exist constants 
D(g),D'(g) > and 7,7' > such that m{B N ) < D(g)h? N and m(A N ) < 

D'(g)h%. 

All our results are proved under the assumption that r and/or / are 
a-regular for some < a < 1. 

Remarks. If g is a-H61der then it is a-regular with m(B g (u,h)) = for all 

u > H(g)h with H (g) the Holder constant of g. 

If g is Lipschitz onS\ {x} and discontinuous at x then m(B g (u, h)) < Ct/i 

for all u > L{g)h where L(g) is the Lipschitz constant of 5 on E \ {x}. 

Functions of bounded variations satisfy this condition (see section |3J). 

An a-regular function need not to be Holder everywhere but we control the 

points where this is not the case. 

1.3. Main result. Let us state our main result for C-weak dependant pro- 
cess. 

Theorem 1.1. Assume that (Aj)j g N is a stationary process absolutely con- 
tinuous and is C-weakly dependant with respect to (li)iGN, (^)ieN * s a sta- 
tionary process taking values in R. Let r be the regression function (r = 
~E(Yi\Xi)) and f, the density function, we assume that inf / > 0. Then there 
exists M > 0, L > 0, V > 0, R > 0, a set A U)h such that for all n£f, for 
all t € M + , for x ^ A u ^, for all u > h ■ Diam(D), m(A u f l ) < RK 1 , 
if f is a-regular, 

(1.1) P(|/n(z) - /(x)| > t - u a ) < 2ee exp[-£ 2 Mn/i /3+2 ], 

if r is a-regular and bounded, 
(1.2) 
P(|f„(x) -r(x)| > t-u a ) < eh f2exp[-t 2 Ln/i /3+2 ] + expf-L'n/j^ 2 



if r is a-regular and Yi £ L°°, 

(1.3) P(|f„(x) - r(x)| > t - u a ) < 2e^ expf-^Ln/i^ 2 ]. 

^4s a consequence, provided h = h n goes to zero and nhn = 0(n e ), e > 0, 
we obtain the following convergences : 

• form almost all x € X, r n (x) converges to r(x) almost surely and in 
LP for any 1 < p, the same holds for f n (x) provided f is a-regular. 



• El/ \r n (x) — r(x)\ \ goes to zero, the same holds for f n (x) provided 

f is a-regular. 

• for almost all x eE, for a < \, \r n (x) — r(x)\ = Of(n~ a ), the same 
holds for fn{x) provided f is a-regular. 
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1.4. Dynamical systems. We turn now to our main motivation : dynami- 
cal systems. Consider a dynamical system (E, T, fi). That is, T maps E into 
itself, fi is a T-invariant probability measure on E, absolutely continuous 
with respect to the Lebesgue measure m. We assume that the dynamical 
system satisfy the following mixing property : for all ip 6 L l (fi), ip E C, 



(1.4) 



<$(n)|M|i|H| c , 



tp ■ ip o T n d\i — I tpdfi / yd/i 
s s s 

with $(n) summable. 

Definition 2. A dynamical system (E,T, /i) is of the good class if 

(1) E is a bounded subset of M. d , 

(2) the map T : E — ► E is a-regular, 

(3) the invariant density / (fj,= fm) verify inf / > 0, 

(4) it satisfy the mixing property (|1.4|) . 

We shall also assume that the Banach space C is regular in the sense that 
for any ip £ C, there exists R(ip) £ M. such that 

(1.5) ||p + l*(¥>)||oo<C(p). 

We consider the stationary process taking values in E : Xj = T l with law 
|U and Yi = Xi+i + £* where the £j are L 1 independent random vectors, 
with independent coordinates, identically distributed random variables with 
mean and independent on (Xj)j g pj. Then T is the regression function 
T{x) = E(Yj|JQ = x). Let r%, is the estimator for Y? = X 3 i+l +e^, defined by 
(|t).l|) . j = 1, . . . , d, Xj is the jth coordinate of Xi and e\ is the jth coordiante 
of Ej. We shall denote T n the estimator of T which has coordinates r 3 n . 
Theorem 1 1 . 1 1 applied in the context of dynamical systems gives the following 
result. 

Corollary 1.2. Let T be a dynamical system of the good class, assume 
that the Banach space C is regular. There exists M > 0, L > 0, R > 0, 
< (3 < 1, 7' > such that outside a set of measure less than Rh J , for all 
t£l+, for allu > Cth, 

P(||f n (x) - T(x)\\ d >t-u a ) < 2de l e exv[-t 2 LhP +2 n] 

P(|/n(aO - f{x)\ >t-u a ) < 2e« exp[-t 2 Mh^ +2 n] if f is a-regular. 

As a consequence, provided h = h n goes to zero and nhn = 0(n e ), e > 0, 
we obtain the following convergences : 

• form almost all T n {x) converges to T(x) iEE, f n (%) converges to 
f{x) almost surely and in LP for any 1 < p, if f is a-regular, f n (x) 
converges to f(x) almost surely and in LP for any 1 < p, 

• El/ ||T n (x) —T(x)\\d J go to zero, if f is a-regular, 

E ( / \fn(x) - f{x)\dx\ go to zero. 

• for almost all i£E, for a < \, \g n (x) — g(x)\ = Op(n _a ) where g n 
is either f n or T n and g is either f (provided f is a-regular) or T. 
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2. Proofs 

Let us now prove the results stated in the above section. The main in- 
gredient is an exponential inequality that has been obtained in |DePj in the 
case C = BV, and in |Mau2j for more general spaces C. 

2.1. Main ingredient : exponential inequality. 

Proposition 2.1. Let (Xj)j S n be a C-weakly mixing process. Let the coeffi- 
cients &c(k) be defined by (0). For <p G C, p > 2, define 



Sn{<-p) =^2<f( x i) 



i=l 
and 

\fc=0 / 

For any p > 2, we have the inequality : 



\S n {<p) -E(5„M)|| P < 2^(0)^6^ 



i 

n— 1 \ 2 



(2.1) <C(ip)[2p^2(n-k)^ c (k) 

\ k=0 / 

As a consequence, we obtain 

F(\S n (<p)-E(S n (<p))\>t) 



(2.2) < ee exp 



-t 2 



2e(C(^)^c(0)E n kZl ) (n-k)^c(k) 



Applying Proposition 12.11 to the function (f> = K will provide an expo- 
nential inequality for f n . The proof of Proposition 12.11 mav be found in 
|Mau2j (Proposition 1.1), see also |DePj for a version of this proposition 
with C = BV . In order to have an exponential inequality for g n , we need 
the following result. 

Proposition 2.2. Let (Xj)j S n be a C-weakly mixing process with respect to 
(Yi)i € ?$. Let the coefficients &c,(Yi) ieN {k) be defined by l**\) . For ip G C, 
p>2, define 

n 

s n (<p) = J2 Y M x i) 

and 

Kn = *C(0),(Yi) i&l ( Y2®C,(Y lh e,( k ) ) II^M " E(W*0)llf C(<p). 
\k=0 / 
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For any p > 2, we have the inequality : 



\S n (<p)-E(S n (<p))\\ p < \2pJ2hr 



\ i=i / 

(n~\ \ 2 

k=o J 

As a consequence, we obtain 

F(\S n {<p)-E(S n {<p))\>t) 

(2.4) < e« exp | = | . 

Proof. Proposition 12 . 21 is proved as Proposition 1.1 in Mau2 using the vari- 
able Zi = Ynp(Xi) - E(Yi<p(Xi)) for which we control ||E(Z fc |.M i )|| 00 with 
the coefficients ®c,(Yi) ieN (k — i), for k > i. D 

2.2. Proof of Theorem ll.il As already mentioned, the proof of Theorem 

II .11 uses Propositions 12. ll and 12.21 applied to (p = K . This gives a control on 

the deviation between f n (x) (resp. gii(x)) and E(/ n (x)) (resp. K(g n (x))). 

Then it remains to see to that E(/ n (x)) is close to f(x). In order to obtain 

the result for the regression function, we deduce from the previous results 

^ KCg n (x)) 

a control on the deviation between r n (x) and — -^ , then we prove that 

E(/n(x)) 

— -^r is close to r(x). To this aim, we begin with two lemmas. 

Hfn(x)) 

Lemma 2.3. We assume that the density function f is a-regular. For all 
u > hdiam(D) =: k, if x Bf(u,k) we have : 

\nTn(x))-f(x)\<u a . 

Proof. We have : 



nTn(x)) = [ K{y)f{ 
Jd 



x — hy)dm(y). 



If x ^ Bf(u, k) then for y € D, 

\f(x)-f(x-hy)\<u a . 

The result follows from the fact that the integral of K is 1 . □ 

Lemma 2.4. We assume that the regression function r is a-regular. For 
all u > hdiam(D) =: k, if x B r (u, k) we have : 

E(?n(x)) 



r(x) 



<u c 



E(/n(x)) 

Proof. We have : 

RCgjx)) ' r ( x -y h ) K ( x )f( x -y h ) dm (y) 



D 



E (fn( x )) / K(x)f{x-yh)dm{y) 



D 
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If x B r (u, k) then for y £ D, 

\r(x) — r(x — hy)\ < u a 
and the result follows. □ 

Proof of Theorem M.ll Proposition 12 . II applied to (p(t) = K (^-^) gives : for 
all t > 0, for all n £ N, for all i£E, 



P(|/ n (x)-E(/ n (x))| >t)<eeexp 



t 2 nh?+ 2 \ 



2eR$c{0)C(K)J ' 
where R is the smallest positive number such that 

n-\ 

^(n-fc)$c(fc) < Rn- 
fc=o 

This together with Lemma 12 . 31 gives (|l.lj) for x Bf(u, k) (k = hdiam(D)). 
In order to obtain the estimation for the regression function r, we apply 
Proposition O to <p(t) = K (^). We obtain : 



F(\g n (x)-E(g n (x)\ > t) < e~ exp 



*W+ 2 



2ei?'$ c , (yi)ieN (0)C(K) 

where R' is the smallest positive number such that 

71-1 

Y,(n-k)<S> CXYih€N (k)<R'n. 



k=0 



Now, for any t > 0, 
E(£„(s)) 



r n (x) 



E(/n(x)) 



P( |5„(x)-E(5 n (x)| >-E(/ n (x 



+ 



|/ n (x) -E(/ n (x)| > -E(/ n (x)f n (x)" 1 



Now, assume that Y{ 6 L°°, we remark that |r n (x)| < ||^||oo =: Umax thus : 



f n (x) 



E@„(x)) 



E(/n(x)) 



>* 



+ 



|# n (x) -E(g n (x))\ > -inf/ 
|/ n (x)-E(/ n (x))|>|j^ 



Propositions 12. II and 12.21 give : 



r n [x 



E(9n(x)) 



where 



C = max 



E(/n(x)) 

(inf/) 2 



>t < 2e^ exp[-CiV +2 rj] 



(inf/) 5 



8ei2'$ Ci(yi)!gN (0)C(^) ' 8ei*M0)C(tf)s& 
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Lemma 12.41 gives (|1.3(l for x B r (u,k). Let A^ = Bf(u,k) U B r (u,k), the 
first part of Theorem 11.11 is now proved for x ^ A^ if Y{ € L°° . 
If we don't assume Yj € L°°, but r is bounded by r max , we write : 



r n (x) 



E(?„(x)) 



E(/n(x)) 



>t < 



5 n (cc) - E(ff n (x))| > -,/W(V 



+P( \f n (x)-K(f n (x))\>-f n ( X ) 



E(g n (a)) 

E(/n(x)) 



< 



5n(x) -E(5n(x))| > £inf/) +Pf|/ n (x) -E(/ n (x))| > t -^- 
4 / V 4 r maY 



+P /„(*) < 



inf/ 



Propositions 12. II and 12.21 give : 
m [.^ , x E(o n (x)). \ i 

V E(/n(x)) J ' 



2exp[-CW +2 n] + exp[-C"W +2 



where C and C" may be expressed in terms of inf/, r max , i?, JR',<l>c(0), 



* 



C,(Yi)i 



(0), C(ET). Then JT2J) follows as above for x £ A h . 



Let us conclude with the almost everywhere and LP convergence, for all 
1 < p. Let us begin with the convergence in L p (m). We fix 1 < p. We 
remark that since the kernel K is bounded, so is f n : sup(/ n ) < ^ — . Also, 
if x Bf(u,e) then f(x) < ■^-f-. Let E tyU (x) be the event 

E ttU (x) = {\f n (x)-f(x)\>t-u a }, 
we have : 

\f n (x)-f(x)\PdF = f \f n (x)-f(x)\PdF 

JE t)U (x) 



'{Et,u(x)] c 

l + u a . sup K V 



< ¥(E ttU (x)) 
Ct 



+ 



diam{D)h h 



+ (t-u 



a\p 



< — exp[-CiVn] + (t- u a ) p . 



Take u such that u a = ^, i = 2u a and h = h n = 0{-^) with < £ < -X^. 
Then for x A n = Bf(u n ,h n ), there exists < k < 1 and constants 
Ci,C 2 >0 such that : 



||/„(s)-/(x)E<Cie- 



-C 2 n K 



+ 



1 

Inn 



Thus, \\fn(x) - f(x)\\ p goes to zero for x B N = n n > N U p >„ A p and 
m(-Bjv) = 0(hjf). We conclude that for almost all x 6 E, / n (x) goes to 
/(x) in L p (m). 
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The proof of the almost everywhere convergence is in the same spirit. Con- 
sider a sequence t m decreasing to 0, for x € E, 

,(*) -h /(*)}) < lim lim y t m\f n (x)-f(x)\>t m }). 

m— »oo N^oo L — ' 
n>N 



lim V P ({l/n(^) - /0*0I > W) = 

A— >QO ' — * 



Take h = h n = O(^) with < £ < ^ then 

E 

n>N 

provided t m > u a with u a > Cth n and x g - -B u ,h n , we take u = u n = ^- and 

A n = B Un;hn , for x £ (1 N U n > N A n =: B, f n (x) goes to f(x) and m(B) = 0. 
The proofs of L p (m) and almost everywhere convergence for r n follow the 
same lines. 

Finally, to prove the bound for the convergence in probability, recall that 
u n = Op(n~ a ) if and only if 

lim limsupP(n a |u n | > M) = 0. 

Following the same lines as above, we prove that for g either / or r and g n 
either f n or ? n , 

lim limsupP(n a |<? n (:r) - g(x)\ > M) = 

A-/— »oo n ^oo 

for a < 5, x B = fl n >o U p > n A p and m(B) =0. □ 

Let us show how we can apply the above result to dynamical systems. 

2.3. Dynamical systems and time reversed process. It turns out that, 
in general, the process (Xj)j € pj associated to a dynamical system (E, T, fj,) 
is not weakly dependent. Nevertheless the condition of weak dependence 
is satisfied for a "time reversed process" whose law is the same as (Xj)j 6 ^. 
Indeed, Condition 11.41 together with the fact that the space C is regular 
(recall (|1.5|) ) gives (see |DePj or |Mau2j for the details) : for all i £ N, for 
ip £ C\, (p € L 1 , \\cp\\i < 1, 

(2.5) jCov^PQ), <p(X n+i ))\ < 2$(n). 

So that, if we consider the process (Xj)j € pj defined by 

(X , . . . ,Xn) L = W (X n , ...,X ) V?i G N, 

it is C- weakly dependent. 

Recall that Y{ = Xi + \ + E{ with (ej)j g pj independent random vectors, with 
independent coordinates, of zero mean, which is also independent of (Xj)j e ^. 
In order to estimate the regression function E(3^|Xj) = T, we shall estimate 
separately each coordinates of Xi + \. Let 

n-\ 



aw - kpt*^) 



i=0 
-I n— 1 / ir \ -. ra— 1 



^ v ^(^) + _l t4K^) 



z=0 i=0 
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Equation (|2.5j) implies also that (Xj)j G n is C-weakly dependant with respect 
to each (Xj +1 )i^, and also to each (ej), j = 1, . . . ,d. We obtain inequalities 

(|1.3j) separately for the two parts of g%.(x) and then the two parts of r J n (x). 
Then we deduce Corollary 11.21 

3. Examples and simulations 

We shall now give some examples of discrete dynamical systems satisfying 
(12.511 . such that T is a-regular, admits a unique invariant density (with 
respect to the Lebesgue measure), / which may be also a-regular. 
There is a large class of dynamical systems satisfying (|2.5(l . we refer to 
the literature on dynamical systems : |Yll IY21 IMaull ILH IColl IBuMaull 
IBuMau2l ILiSVl IBroj and many other, see |Ba| for a review on these topics. 
Below we consider Lasota-Yorke maps, unimodal maps, piecewise expanding 
maps in higher dimension. Our results should also apply for hyperbolic maps 
but a more intricate study on the invariant density is required. 

3.1. In dimension one : piecewise expanding maps. We shall consider 
piecewise expanding maps or "Lasota-Yorke" maps. Consider / = [0,1], 
partitioned into subintervals Ij. On the interior of the ij's, the map T is 
C 2 , uniformly expanding \T'\ > A > 1, and continuous on the closure of the 
ij's. Under additional conditions of mixing or covering (see below), it is well 
known ( |Brol ILiSVl FLU IColj ) that T admits a unique absolutely continuous 
invariant measure whose density / belongs to the space BV of functions of 
bounded variations. 

Before we give precisely the hypothesis on the map, we prove that functions 
of bounded variations are regular. We shall denote m the Lebesgue measure 
on /. 
Recall that a function g on I belongs to BV if 

n 

V a = swp^2\g(oi) -g(a i+ i)\ < oo 

i=0 

where the supremum is taken over all the finite subdivisions of / ; ao = < 
ai < • • • < a n+ \ = 1, then V g is the total variation of g. 

Lemma 3.1. Let g belongs to BV. Then for any sequences (u n ) n ^ and 
(/i n ) nS N decreasing to zero with h^~ a < u 2 n , < a < 2, let B g (u,h) be the 
set of points x such that 

sup \g(x) - g{y)\ > u, 

d(x,y)<h 

A n = B g (u n ,h n ) and B N = Q [J A p . 

n>N p>n 



Then 



m(B N ) < 3(\Jg)hl and m(A n ) < 2(\/ g)^ 

Ur 



'"It 

Proof. Let xq = inf Bjy and x\ such that < x\ — x$ < hjy and x\ G B^. 
Then x\ belongs to A p for infinitely many p > N. In particular, there 
exists p\ > N and X2 with \x\ — X2I < h Pl and \g(xi) — g(x2)\ > u Pl . Set 
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xq = a?i,0) x i = 371,1 and rr 2 = £i, 2 . We construct sequences (maybe finite) 
(xi t o,Xi,i,Xi,2) and (pi) such that : 

(1) pi > N for all i > 1, 

(2) Xi t o < mm(x it i,x it 2) ; Zi+1,0 > max(xi i i,Xi i2 ), a^i G A Pi , 

(3) |iCi,o ~~ x i,i\ < ^Pi-i f° r all i > 1, with the convention that /i po = /ijv, 

(4) |x i; i - Xj >2 | < h Pi for all i > 1, 

(5) 13(^1) - g(x i>2 )\ > u Pi for all i > 1, 

(6) -Bat C Ui>i J»with Jj = [a i ,&i+/ip i _ 1 ], [a^] = [a^ca^U^!,^]. 

If x^o^i.i, %i,2 are already constructed, let Xj+^o = inf[x G i?7v ; x > 
max(xj j i,Xj j 2) + /i p J. Now, there exists £j+i,i G -Bat with < ajj+i,i — 
Xi+i,o < /ipi- Since Xj+i,i G Bat, there exists pi+\ > Pj and Xj + i i2 with 
|x;+i,i - a;i + i )2 | < h Pi+1 and ^(xj+^i) - #(xj + i )2 )| > u Pi+1 . In that way, all 

i+i 
the points of Bat smaller than max(xj + i i2 ,Xj + i j i) + /i Pi+1 are in M Jj. The 

i=i 
construction stops when {x G i?/v, x > max(ij ) i, s^) + /i Pi } is empty. 

As a consequence of this construction, we get : 

/ u pi <\/9 remark that the Jj's are disjoint, 
i>l 

m(B N ) < sYlh Pi . 

i>0 

Now, using Cauchy-Schwartz inequality, 



i>0 



< 




3 • fojy recall that /i^~ a < u n . 



Finally, we have proven : m(B]y) < 3(\J g)hj^. 

The estimation on the measure of A n is much simpler. We may cover A n by 
balls of radius 2/i n , centered on points x such that snp^ xy \ <hn \g(x)—g(y)\ > 
u n . Then, m{A n ) < 2 • k ■ h n where k is the number of such balls. Now, 

\J g > k ■ u n , thus k < . □ 

3.1.1. Finite number of pieces. This subsection is devoted to Lasota-Yorke 
maps with a finite number of intervals of monotonicity. The main differences 
between infinite and finite pieces are some additional technical assumptions 
in the infinite case. 
We consider the following conditions that may be found in jColl iLij . 

Assumption 3. 
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(1) The partition (Ii)j=i ... r of I is finite, denote by Vk the partition of 
invertibility of T k . 

(2) T satisfies the covering property : for all k € N, there exists N(k) 
such that for all P £ Vk, 

T N Wp = [0,1]. 

(3) On each Ij, the map T is C 2 . 

The following result is standard and may be found in f [CJoll [Lil IF3aj ) . 

Theorem 3.2. Under Assumptions^ the map T admits a unique absolutely 
invariant probability measure, its density f belongs to BV and inf / > 0. 
Moreover, if \i = fm is this invariant probability measure, then jl.4\) is 
satisfied with C = BV and $(n) = j n , < 7 < 1 : there exists < 7 < 1, 
C > 0, such that for any ip € BV and (p £ L 1 (/x), for any nGN, 



ip ■ if o T n d[i — / ipdfj, / (/9(i/.i 
7 11 



<c 7 n IM|i||Vllw- 



We already know that / is regular by Lemma 13. II In order to have the 
conditions of Corollary 11.21 it remains to prove that T is regular. This is 
indeed clear because T is piecewise C 2 , with finitely many points of disconti- 
nuity. As a consequence, for any sequences (u n ) ng N and (/i n )neN decreasing 



to zero with hw < U 1L,\ , 

ly — sup I T' I ' 

V 

m(A N ) < } \gi\ ■ h N 

i=l 

where we have denoted by xi, i = 1, . . . ,p the points of discontinuity, and 
gi the gap at Xi : g. t = \T(x^) — T(xf)\. Moreover, as soon as u^ < inf gi, 
we have that A n D A n+ i, so Bn = An and we have the required control on 
the measure of An- 
Thus, Corollary 11.21 applies and we get the following result. 

Corollary 3.3. Let T satisfies Assumption^ let K be a Kernel belonging 
to BV , let /jv and T/v be the estimators of f and T . For allO < a < 1, here 
exists M > 0, L > 0, R > 0, such that outside a set of measure less than 
Rh a , for all teM + , for all u > Cth, 

(3.1) m?n(x) - f{x)\ >t-u a )< 2e- exp[-t 2 Mn/i 2 ] 

(3.2) P(|T n (x) - T(x)\ >t-u a )< 2e= exp[-t 2 Lnh 2 }. 

As a consequence, provided h = h n goes to zero and nh^ = 0(n £ ), e > 0, 
we obtain the following convergences : 

• form almost all x € E, f n { x ) converges to f(x) andT n (x) converges 
to T(x) almost surely and in LP for any 1 < p, 

• El/ \fn(x) — f(x)\dx J and E I / \T n (x) — T{x)\ I go to zero. 

• for almost all x € E, for a < g, \9n{x) — g(x)\ = Op(n _a ) where g n 
is either f n or T n and g is either f or T. 
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Proof. We apply Corollary 11.21 Since the Kernel K belongs to BV, we may 
take (3 = 0. Also, with our hypothesis, T and / are in BV so they are 
regular according to Lemma 13. II □ 

Simulations for /3-maps. For (3 > 1, the /5-map is Tr(x) = fix mod 1. This 
map has been widely studied. It is well known that it satisfies Assumptions 
13.31 It is the simplest example of a non-markov map on the interval. There 
is an explicit formula for the invariant density (see |Taj ) : 



Mx) 



CJ2/3- 



(<+!)■ 



[0,TH](x)i 



where C is a constant chosen so that fp has integral 1. 
We have performed some simulations for several values of of /3, different 
Kernels and for several noises e,. These simulations are summarized in the 
following table. In this table, AMEf means absolute mean error for the 
density i.e. 



AMEf = - 



V 



fe=i 



\fn{Xk) ~ f(Xk)\, 



where the x^ are p points regularly espaced on I on which we have calculate 
f n . Below, p = 200. AMET means absolute mean error for the map T (same 
formulae). For the Kernel, P2 is the degree 2 polynomial : |(1 — x 2 ). 



n 


h 


AMEf 


AMET 


P 


kernel 


noise, gj 


10 4 


0.01 


0.08234419 


0.008309136 


27 
11 


P2 


no, £i = 


10 4 


0.005 


0.09906515 


0.004301326 


27 

11 


P2 


no, £j = 


5-10 4 


0.007 


0.04428149 


0.005530895 


■27 
11 


P2 


no, £i = 


2-10 5 


0.001 


0.05107575 


0.001799785 


'27 
11 


P2 


no, Si = 


5-10 4 


0.007 


0.05492035 


0.003809815 


27 
11 


\-U\ 


no, Ei = 


5-10 4 


0.007 


0.04728425 


0.008303824 


■27 
11 


P2 


W[-0.3,0.3] 


2-10 5 


0.0005 


0.07806642 


0.011328519 


27 

11 


P2 


W[-0.3,0.3] 


10 4 


0.01 


0.07473928 


0.020744986 


27 
11 


P2 


Af (0,0.3) 


5-10 4 


0.007 


0.04269281 


0.011423138 


27 

11 


P2 


A/"(0,0.3) 


2-10 5 


0.001 


0.05107575 


0.001799785 


27 

11 


P2 


AT(0,0.3) 


5-10 4 


0.007 


0.05329131 


0.007722570 


27 
11 


lf_l II 


W[-0.3,0.3] 


10 4 


0.01 


0.08165332 


1.648713e - 02 


46 

11 


P 2 


no, Ei = 


5-10 4 


0.007 


0.04259507 


1.071092e - 02 


46 
11 


P 2 


no, Ei = 


2-10 5 


0.001 


0.05249840 


1.536396e - 04 


4b 
11 


P2 


no, Ei = 


5-10 4 


0.007 


0.03810643 


1.482175e - 02 


46' 
11 


P 2 


W[-0.3,0.3] 


5-10 4 


0.007 


0.03961733 


1.763502e - 02 


46 
11 


P2 


Af (0,0.3) 


5-10 4 


0.007 


0.05467709 


7.079913e - 03 


46 
11 


\-U\ 


no 


5-10 4 


0.007 


0.05109682 


1.036748e - 02 


46 
11 


\-U\ 


W[-0.3,0.3] 



Remark. It seems that there no real influence of the kind of Kernel nor on 
the noise. What seems more surprising is the fact that the estimators for T 
are much better than for /. Also, remark that it seems that the best h n is 
not the same for / and for T. 



ic; 
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The graphics below correspond to (3 = jj, K = 1, 1 1,, e« -~> U[— 0.2,0.2], 
n = 50000 and h = 0.007. 



density 



Estimator for x -> 27/1 1 x mod 1 





0.0 0.2 0.4 0.6 



0.2 0.4 0.6 0.8 



difference for the density 



difference for T 





0.0 0.2 0.4 0.6 



3.1.2. Infinite number of pieces. If the number of pieces is infinite, their are 
many different settings leading to existence and uniqueness of an absolutely 
continuous invariant probability measure which is exponentially mixing. Let 
us cite |Brol ILiSVj . We will consider the conditions of Liverani-Saussol- 
Vaienti |LiSVj in a restricted case (our potential is |T'| _1 , they consider 
more general potential). These conditions are the following. 

Assumption 4. (1) There is a countable partition (7j)j g pj of / into in- 
tervals. On each Ij, the map T is monotonic and C 2 , it is continuous 
on each Ij. Denote by V the partition (Ij)j^ and by Vk the parti- 
tion of monotonicity of T 

(2) 



1 



— — - € BV and > sup — 

\T \ 4 \T 



< oo. 



(3) The partition V is generating. 

(4) The map T is covering, which means, in the infinite case, that for 
any n G N, for any P G V n , I may covered by a finite number of 
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subintervals of T N P : 

Vn G N, VP G P n , 3N, 3 finite J C V N \j '{P} / \J T N Q = X. 

QeJ 



17 



Remark. The condition — — G BV is satisfied provided that T has the 
bounded distortion property : 

\T"(x)\ 
sup sup 2 < oo 

PGPxGP \T'{X)\ Z 



and that N sup — — < oo. 



PGP 



The following result has been proved in a general setting in |LiSVj and in 
IBrol under the condition of bounded distortion. 



Theorem 3.4. Under Assumptions^ the map T admits a unique absolutely 
invariant probability measure, its density f belongs to BV and inf / > 0. 
Moreover, if \i = fm is this invariant probability measure, then jl.4[ ) is 
satisfied with C = BV and <£(n) = r f l , < 7 < 1 : there exists < 7 < 1, 
C > 0, such that for any ip G BV and (p G L l ([i), for any n G N, 



/ 



ip ■ if o T n dfi 



fitoj 



ipdfi 



<C7 n IMIilMlflv. 



Lemma 3.5. The map T satisfying Assumptions^ is regul 



ar. 



Idea of the proof. This is a simple consequence of the fact that T is piecewise 
C 2 (thus piecewise C 1 ) and that \, SU P 77777 < °°- ^ 

PGP ' ' 

There are several natural examples of dynamical systems satisfying As- 
sumptions 0J Let us cite the Gauss map : 



T(x) 



1 
x 



1 




1 






— 


— 


— 


+ 1- 


- a 


X 




X 







which appears in the continuous fractions decomposition ( |Broj ) and in anal- 
ysis a gcd algorithms (0). An natural extension of these systems are 
"Japanese systems" or a-Gauss maps : for < a < 1, 

T a {x) = - 
x 

T a maps the interval [a — 1, a] into itself. See |BoDaVJ for a description of 
these systems as well as an application to analysis of generalized Euclidian 
algorithms. The maps T a satisfy Asumption 0] for < a < 1. 

Corollary 3.6. Let T satisfies Assumption^ let K be a Kernel belonging 
to BV , let /jv and T/v be the estimators of f and T . For all < a < 1, here 
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exists M > 0, L > 0, R > 0, such that outside a set of measure less than 
Rh a , for all teR+, for all u > Cth, 



(3.3) 
(3.4) 



W(\fn(x) ~ f(x)\ >t-u a )< 2e- exp[-t 2 Mnh 2 } 
P(|T n (x) - T(x)\ >t-u a ) < 2e= exp[-t 2 Lnh 2 }. 



As a consequence, provided h = h n goes to zero and nh\ = 0(n £ ), e > 0, 
we obtain the following convergences : 

• form almost all x E /, fn{x) converges to f{x) andT n (x) converges 
to T(x) almost surely and in LP for any 1 < p, 

• El/ \fn(x) — f(x)\dx J and E I / \T n (x) — T(x)\ J go to zero. 

• for almost all x a I, for a < 2, \dn(x) — g(x)\ = Op(n~ a ) where ~g n 
is either f n or T n and g is either f or T. 

The following graphics are for the Gauss map T(x) = - (mod 1) for which 
it is known that the unique invariant probability density is f(x) = j-j^ jj—. 
We have performed simulations with n = 50000, h = 0.009 and the noise 
Si ^ W[-0.2, 0.2]. Over 800 points, we get EMEf = 0.05046933, AMET = 
0.05141938, if we restrict to points in [0.2, 1] we get AMET = 0.01439787. 



density 




x -> 1/x mod 1 



0.0 0.2 0.4 



0.2 0.4 o.e 



difference for the density 



difference for T 




0.0 0.2 0.4 0.6 



0.2 0.4 OJ 
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3.2. In dimension one : unimodal maps. Let I = [—1,1] and /:/—►/ 

be a C 2 unimodal map (i.e., / is increasing on [—1,0], decreasing on [0, 1]) 
satisfying /"(0) ^ 0, and, 

(HI) There are < a < 1, K > 1, and A < A < 4 with e 2a < A, and 

sup 7 | /'| < \ K < 8 so that 

W l(/ n )'(/(0))| > \ n for all n G N and A = lim^ | (/™)'(/(0)) I 1 /". 

(ii) |/™(0)| > e~ an , for all n> 1. 

(H2) For each small enough 5 > 0, there is M = M{5) G N+ for which 

(i) If x,...,/*^ 1 ^) (-5,5) then |(/ M )'(x)| > A M ; (ii) For each n, if 

x,...,/"- 1 ^) £ (-5,5) and/ n (z) G (-5,5), then |(/ n )'(x)| > ~\ n . 

(H3) / is topologically mixing on [/ 2 (0), /(0)], that is for any two open sets 

U, V C I, there exists JVeff such that Vn > AT, T~ n U n V / 0. 

Examples of unimodal maps satisfying (H1)-(H3) are quadratic maps 
1 — a ■ x 2 for a positive measure set of parameters a. (See e.g. |BeCa| ). 
The following theorem is obtained in two steps. First, it is proven that 
unimodal maps satisfying (H1)-(H3) are conjugated to a kind of hyperbolic 
markov maps called Young towers (see |Y1| IKeNoj ) . Then the estimation on 
the speed of mixing is obtained on the tower ( |BuMaul| ) . Let us emphasize 
the fact that the kind a mixing we need (namely II. 4J) and especially the fact 
that ||y||i appears, is not obtained in |Y1| IKeNo] (a form with the ||</?||oo 
was obtained there). 

Theorem 3.7. Let T be a unimodal map satisfying (H1)-(H3). The map 
T admits a unique absolutely invariant probability measure with density f 
satisfying inf / > 0. Moreover, if u = fm is this invariant probability 

if 2 (0)J(0)] 

measure, then \l-4\) is satisfied with C the space Lip of Lipschitz functions 
on I and <J>(n) = 7™, < 7 < 1 : there exists < 7 < 1, C > 0, such that 
for any ip G Lip and if G L l {[i), for any n G N, 

<C7 n |M|i||V|lLip- 



tp ■ tp o T djj, — I ipdn I <pdfx 



This is clear that T is 1-regular : for well chosen e n and u n , the sets 
An and Bjy are empty because T is C 2 . Nevertheless, it is known that the 
invariant density is very irregular (see |Y2I IKeNoj ) . With a more intricate 
study, we should probably prove that / is also regular. Here, we restrict 
ourselves to the estimation of T. 

Since the invariant measure has its support in S = [/ 2 (0), /(0)], our esti- 
mates are valid only for x G S. 

Corollary 3.8. Let T be a unimodal map satisfying (H1)-(H3), let K be a 
Kernel belonging to Lip ; let T/v be the estimator ofT. There exists M > 0, 
L > 0, R > 0, such that outside a subset of S, of measure less than Rh, for 
allt G M + , for all u > Cth, 

P(|T„(x) - T(x)\ > t - u) < 2ee exp[-t 2 Lh 2 ?i\. 

As a consequence, provided h = h n goes to zero and nh 2 ^ = 0(n £ ), e > 0, 
we obtain the following convergences : 
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for m almost all x € S, T n (x) converges to T{x) almost surely and 
in IP for any 1 < p, 



\T n (x) —T(x)\ I go to zero. 



E 



• for almost all x 6 S, for a < ^, \T n (x) — T(x)\ = Op(n a ). 

The following graphics are for the map T(x) = 3.8 * x * (1 — x). We 
have performed simulations with n = 50000, h = 0.01 and the noise e, ~> 
W[-0.2,0.2]. Over 154 points in 5, we get AMET = 0.004114143. 



estimator for x -> 3.8*x (1-x) 




difference for T 




3.3. Piecewise expanding maps in higher dimension. These are gen- 
eralisations of Lasota-Yorke maps in higher dimension. These maps have 
been studied in [Bull IBuPaSl IEuTI lBu2l lBu5l IC^wl IGoBol ISall] from var- 
ious point of view. The control of the speed of mixing may be found in 
|BuMauTl IBuMau2] , the strategy is as for unimodal maps : the map is con- 
jugated to a "Young tower". 

The setting is the following. (X, Z, T) will be a piecewise invertible map, 
i.e.: 



• X = U ZeZ Z is a locally connected compact subset of R . 

• Z is a finite collection of pairwise disjoint, bounded and open subsets 
of X, each with a non-empty boundary. Let Y = IJzg^ ^ 

• T : Y — > X is a map such that each restriction T\Z, Z £ Z, coincides 
with the restriction of a homeomorphism Tz : U — * V with U, V open 
sets such that U D Z,V D T(Z). 

T will be assumed to be non- contracting, i.e., such that for all x,y in the 
same element Z £ Z, d(Tx,Ty) > d{x,y). Also Z will be assumed to 
be generating, i.e., linv^oo diam(Z n ) = where Z n denotes the set of n- 
cylinders, i.e., the non-empty sets of the form: 

LV..^n-i]:=A)n---nT-"+ 1 A n _ 1 

for Aq, . . . , A n —\ € Z. Finally the boundary of the partition, dZ = \J ZeZ dZ, 

will play an important role in our analysis. In particular, we shall assume 
"small boundary pressure" (see below), a fundamental condition which al- 
ready appeared in |Bu4l IBu51 IBuPaSj . To formulate the crucial "small 
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boundary pressure" condition, we need first some definitions. 
The topological pressure |DGSj of a subset S of X is: 



P(S,T) = lim sup - log Yl 9 {n) ( A ) 



n 

Aez n 

AnS^H 

where g {n) {A) = sup xeA g(x)g(Tx) . . . g(T n ~ l x), g= (detT'l" 1 . 
The small boundary pressure condition is: 

P{dZ,T) <P{X,T). 

This inequality is satisfied in many cases. In particular, if T is expanding 

and X is a Riemannian manifold and the weight is | detT'(x)| _1 or close to 
it, then it is satisfied: (i) in dimension 1, in all cases; (ii) in dimension 2, if 
T is piecewise real analytic |Bu51 ITsT] : (iii) in arbitrary dimension, for all 
piecewise affine T |Ts2j or for generic T |Bu31 ICowj . 

A basic example is given by the multidimensional (3 -transformations |Bulj . 
i.e., maps T : [0, l] d — > [0, l] d , T(x) = B(x) mod 7L d with B an expanding 
affine map on M, d . Let us summarize our hypothesis on T. 

Assumption 5. Let (X,Z,T,g) be a weighted piecewise invertible dynam- 
ical system. Assume that: 

• T is expanding, i.e., there is some A > 1 such that for all x,y in the 
same element of Z, d(Tx, Ty) > A • d(x, y); 

• g = | det T'\~ l is Holder continuous with exponent 7 and is positively 
lower bounded. 

• the boundary pressure is small: P(dZ,T) < P(X,T). 

• T is topologically mixing. 

Let K(f) = m&xz^z sup^-^g^ dixv'yy where 7 is some Holder exponent 
of g- 

Theorem 3.9. |BuMauT| Let T satisfy Assumption Then, T admits 
a unique invariant measure fi, absolutely continuous w.r.t. the Lebesgue 
meausre m. This measure is exponentially mixing : 



/ ip o T n ■ ip dfi — / ipdfi ip d\x 
Jx Jx Jx 



<C- \\(p\\c?r(X) ■ IMIl* «"• 



with constants C < 00 and k < 1 depending only on (X,Z,T,g), for any 
measurable functions ip, ip : X — ► K such that ip is bounded and <p is ^-Holder 
continuous. 

This is clear that T is 1-regular because it is piecewise C . Nevertheless, it 
is known that the invariant density has discontinuities on d[T n (Z n )] . With 
a more intricate study, we should probably prove that / is also regular. 
Here, we restrict ourselves to the estimate of T. 

Corollary 3.10. Let T satisfy Asumption\^ let K be a ^-Holder Kernel, 
let Tn be the estimator of T . There exists M > 0, L > 0, R > 0, such that 
outside a set of measure less than Rh, for all t € M + , for all u > Cth, 

P(|T n (x) - T(x)\ > t - u) < 2e= exp[-t 2 L/i 7+2 n]. 
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As a consequence, provided h = h n goes to zero and nhZ = 0(n £ ), e > 0, 
we obtain the following convergences : 

• for m almost all lEl, T n (x) converges to T(x) almost surely and 
in IP for any 1 < p, 

E [ / \T„.(x) — T(x)\ ) go to zero. 



x 



• for almost all x € X, for a < \, \T n (x) — T(x)\ = Op(n a ). 

We have performed some simulations for T(x) = Bx mod Z 2 , with B the 
matrix 

'2.5 3.4 N 
^4.6 3.2 y 

Below are the histograms for dif f x (resp. diff y ), the difference beetween 
the x (resp. y) coordinate of T and the x (resp. y) coordinate of T n , over 
a grid of 100 times 100 points in [0, l] 2 . The Kernel is K = 4l[_i,i]x[-i,i]) 
there is no noise, e$ = 0, n = 66668 and h = 0.004. The AME for the 
coordinate x is 0.01882885, for the coordinate y, the AME is 0.06723186. 



0.2 0.4 0.6 0.8 

difference in x coordinate 



£T ^ 




0.2 0.4 0.6 0.8 

difference in y coordinate 



Remark (Anosov maps). Our technics should also apply to estimate the 
invariant density and the application T for Anosov maps for which there ex- 
ists an invariant measure absolutely continuous with respect to the Lebesgue 
measure. A more intricate study could also lead results of the same kind for 
Axiom A diffeomorphisms. 
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